Certified Data Analyst Associate v1.0

Page:    1 / 6   
Exam contains 85 questions

Which of the following is a benefit of Databricks SQL using ANSI SQL as its standard SQL dialect?

  • A. It has increased customization capabilities
  • B. It is easy to migrate existing SQL queries to Databricks SQL
  • C. It allows for the use of Photon's computation optimizations
  • D. It is more performant than other SQL dialects
  • E. It is more compatible with Spark's interpreters


Answer : B

How can a data analyst determine if query results were pulled from the cache?

  • A. Go to the Query History tab and click on the text of the query. The slideout shows if the results came from the cache.
  • B. Go to the Alerts tab and check the Cache Status alert.
  • C. Go to the Queries tab and click on Cache Status. The status will be green if the results from the last run came from the cache.
  • D. Go to the SQL Warehouse (formerly SQL Endpoints) tab and click on Cache. The Cache file will show the contents of the cache.
  • E. Go to the Data tab and click Last Query. The details of the query will show if the results came from the cache.


Answer : A

Which of the following statements about a refresh schedule is incorrect?

  • A. A query can be refreshed anywhere from 1 minute to 2 weeks.
  • B. Refresh schedules can be configured in the Query Editor.
  • C. A query being refreshed on a schedule does not use a SQL Warehouse (formerly known as SQL Endpoint).
  • D. A refresh schedule is not the same as an alert.
  • E. You must have workspace administrator privileges to configure a refresh schedule.


Answer : C

A data analyst creates a Databricks SQL Query where the result set has the following schema: region STRING number_of_customer INT
When the analyst clicks on the "Add visualization" button on the SQL Editor page, which of the following types of visualizations will be selected by default?

  • A. Violin Chart
  • B. Line Chart
  • C. Bar Chart
  • D. Histogram
  • E. There is no default. The user must choose a visualization type.


Answer : C

A data analyst has created a Query in Databricks SQL, and now they want to create two data visualizations from that Query and add both of those data visualizations to the same Databricks SQL Dashboard.
Which of the following steps will they need to take when creating and adding both data visualizations to the Databricks SQL Dashboard?

  • A. They will need to alter the Query to return two separate sets of results.
  • B. They will need to add two separate visualizations to the dashboard based on the same Query.
  • C. They will need to create two separate dashboards.
  • D. They will need to decide on a single data visualization to add to the dashboard.
  • E. They will need to copy the Query and create one data visualization per query.


Answer : B

A data analyst has been asked to provide a list of options on how to share a dashboard with a client. It is a security requirement that the client does not gain access to any other information, resources, or artifacts in the database.
Which of the following approaches cannot be used to share the dashboard and meet the security requirement?

  • A. Download the Dashboard as a PDF and share it with the client.
  • B. Set a refresh schedule for the dashboard and enter the client's email address in the "Subscribers" box.
  • C. Take a screenshot of the dashboard and share it with the client.
  • D. Generate a Personal Access Token that is good for 1 day and share it with the client.
  • E. Download a PNG file of the visualizations in the dashboard and share them with the client.


Answer : D

A data analyst has been asked to produce a visualization that shows the flow of users through a website.
Which of the following is used for visualizing this type of flow?

  • A. Heatmap
  • B. Choropleth
  • C. Word Cloud
  • D. Pivot Table
  • E. Sankey


Answer : E

An analyst writes a query that contains a query parameter. They then add an area chart visualization to the query. While adding the area chart visualization to a dashboard, the analyst chooses "Dashboard Parameter" for the query parameter associated with the area chart.
Which of the following statements is true?

  • A. The area chart will use whatever is selected in the Dashboard Parameter while all or the other visualizations will remain changed regardless of their parameter use.
  • B. The area chart will use whatever is selected in the Dashboard Parameter along with all of the other visualizations in the dashboard that use the same parameter.
  • C. The area chart will use whatever value is chosen on the dashboard at the time the area chart is added to the dashboard.
  • D. The area chart will use whatever value is input by the analyst when the visualization is added to the dashboard. The parameter cannot be changed by the user afterwards.
  • E. The area chart will convert to a Dashboard Parameter.


Answer : B

A data analyst has been asked to configure an alert for a query that returns the income in the accounts_receivable table for a date range. The date range is configurable using a Date query parameter.
The Alert does not work.
Which of the following describes why the Alert does not work?

  • A. Alerts don't work with queries that access tables.
  • B. Queries that return results based on dates cannot be used with Alerts.
  • C. The wrong query parameter is being used. Alerts only work with Date and Time query parameters.
  • D. Queries that use query parameters cannot be used with Alerts.
  • E. The wrong query parameter is being used. Alerts only work with dropdown list query parameters, not dates.


Answer : D

Which of the following statements about adding visual appeal to visualizations in the Visualization Editor is incorrect?

  • A. Visualization scale can be changed.
  • B. Data Labels can be formatted.
  • C. Colors can be changed.
  • D. Borders can be added.
  • E. Tooltips can be formatted.


Answer : D

A data team has been given a series of projects by a consultant that need to be implemented in the Databricks Lakehouse Platform.
Which of the following projects should be completed in Databricks SQL?

  • A. Testing the quality of data as it is imported from a source
  • B. Tracking usage of feature variables for machine learning projects
  • C. Combining two data sources into a single, comprehensive dataset
  • D. Segmenting customers into like groups using a clustering algorithm
  • E. Automating complex notebook-based workflows with multiple tasks


Answer : C

A data organization has a team of engineers developing data pipelines following the medallion architecture using Delta Live Tables. While the data analysis team working on a project is using gold-layer tables from these pipelines, they need to perform some additional processing of these tables prior to performing their analysis.
Which of the following terms is used to describe this type of work?

  • A. Data blending
  • B. Last-mile dashboarding
  • C. Data testing
  • D. Last-mile ETL
  • E. Data enhancement


Answer : D

Which of the following statements describes descriptive statistics?

  • A. A branch of statistics that uses summary statistics to quantitatively describe and summarize data.
  • B. A branch of statistics that uses a variety of data analysis techniques to infer properties of an underlying distribution of probability.
  • C. A branch of statistics that uses quantitative variables that must take on a finite or countably infinite set of values.
  • D. A branch of statistics that uses summary statistics to categorically describe and summarize data.
  • E. A branch of statistics that uses quantitative variables that must take on an uncountable set of values.


Answer : A

In which of the following situations will the mean value and median value of variable be meaningfully different?

  • A. When the variable contains no outliers
  • B. When the variable contains no missing values
  • C. When the variable is of the boolean type
  • D. When the variable is of the categorical type
  • E. When the variable contains a lot of extreme outliers


Answer : E

A data analyst is working with gold-layer tables to complete an ad-hoc project. A stakeholder has provided the analyst with an additional dataset that can be used to augment the gold-layer tables already in use.
Which of the following terms is used to describe this data augmentation?

  • A. Data testing
  • B. Ad-hoc improvements
  • C. Last-mile dashboarding
  • D. Last-mile ETL
  • E. Data enhancement


Answer : E

Page:    1 / 6   
Exam contains 85 questions

Talk to us!


Have any questions or issues ? Please dont hesitate to contact us

Certlibrary.com is owned by MBS Tech Limited: Room 1905 Nam Wo Hong Building, 148 Wing Lok Street, Sheung Wan, Hong Kong. Company registration number: 2310926
Certlibrary doesn't offer Real Microsoft Exam Questions. Certlibrary Materials do not contain actual questions and answers from Cisco's Certification Exams.
CFA Institute does not endorse, promote or warrant the accuracy or quality of Certlibrary. CFA® and Chartered Financial Analyst® are registered trademarks owned by CFA Institute.
Terms & Conditions | Privacy Policy